SEQUENCE LISTING 

(1) GENERAL INFORMATION 

(i) APPLICANT: Horvitz, Robert 
Yuan, Junying 
5 Shaham, Shai 

(ii) TITLE OF THE INVENTION: RELATEDNESS OF HUMAN INTERLEUKIN - 1 

BETA CONVERTASE GENE TO A C. ELEGANS CELL DEATH 
GENE, INHIBITORY PORTIONS OF THESE GENES AND... 

(iii) NUMBER OF SEQUENCES: 27 

10 (iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Clark & Elbing LLP 

(B) STREET: 176 Federal Street 

(C) CITY: Boston 

(D) STATE: MA 
15 (E) COUNTRY: USA 

(F) ZIP: 02110 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Diskette 

(B) COMPUTER: IBM Compatible 

2 0 (C) OPERATING SYSTEM: DOS 

(D) SOFTWARE: FastSEQ for Windows Version 2.0 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 21-MAY-1995 
25 (C) CLASSIFICATION: 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: 08/394,189 

(B) FILING DATE: 24 -FEB- 1995 

(A) APPLICATION NUMBER: 08/282,211 

3 0 (B) FILING DATE: 12 -JUL- 1994 

(A) APPLICATION NUMBER: 07/984,182 

(B) FILING DATE: 20-NOV-1992 

(A) APPLICATION NUMBER: 07/897,788 

(B) FILING DATE: 12-JUN-1992 

35 (viii) ATTORNEY/AGENT INFORMATION: 

(A) NAME: Bieker-Brady, Kristina 

(B) REGISTRATION NUMBER: 39,109 
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(C) REFERENCE /DOCKET NUMBER: 01997/211002 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 617-428-0200 

(B) TELEFAX: 617-428-7045 
5 (C) TELEX: 

(2) INFORMATION FOR SEQ ID NO : 1 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7653 base pairs 

(B) TYPE: nucleic acid 
10 (C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 : 

D AGATCTGAAA TAAGGTGATA AATTAATAAA TTAAGTGTAT TTCTGAGGAA ATTTGACTGT 60 

15 TTTAGCACAA TTAATCTTGT TTCAGAAAAA AAGTCCAGTT TTCTAGATTT TTCCGTCTTA 120 

ffl TTGTCGAATT AATATCCCTA TTATCACTTT TTCATGCTCA TCCTCGAGCG GCACGTCCTC 180 

CO AAAGAATTGT GAGAGCAAAC GCGCTCCCAT TGACCTCCAC ACTCAGCCGC CAAAACAAAC 240 

W GTTCGAACAT TCGTGTGTTG TGCTCCTTTT CCGTTATCTT GCAGTCATCT TTTGTCGTTT 300 

ffj TTTTCTTTGT TCTTTTTGTT GAACGTGTTG CTAAGCAATT ATTACATCAA TTGAAGAAAA 360 

=p 20 GGCTCGCCGA TTTATTGTTG CCAGAAAGAT TCTGAGATTC TCGAAGTCGA TTTTATAATA 420 

fjj TTTAACCTTG GTTTTTGCAT TGTTTCGTTT AAAAAAACCA CTGTTTATGT GAAAAACGAT 480 

H TAGTTTACTA ATAAAACTAC TTTTAAACCT TTACCTTTAC CTCACCGCTC : CGTGTTCATG 540 

p GCTCATAGAT TTTCGATACT CAAATCCAAA AATAAATTTA CGAGGGCAAT TAATGTGAAA 600 

fll CAAAAACAAT CCTAAGATTT CCACATGTTT GACCTCTCCG GCACCTTCTT CCTTAGCCCC 660 

fU 25 ACCACTCCAT CACCTCTTTG GCGGTGTTCT TCGAAACCCA CTTAGGAAAG CAGTGTGTAT 72 0 

ffj CTCATTTGGT ATGCTCTTTT CGATTTTATA GCTCTTTGTC GCAATTTCAA TGCTTTAAAC 780 

AATCCAAATC GCATTATATT TGTGCATGGA GGCAAATGAC GGGGTTGGAA TCTTAGATGA 840 

^ GATCAGGAGC TTTCAGGGTA AACGCCCGGT TCATTTTGTA CCACATTTCA TCATTTTCCT 900 

GTCGTCCTTG GTATCCTCAA CTTGTCCCGG TTTTGTTTTC GGTACACTCT TCCGTGATGC 960 

3 0 CACCTGTCTC CGTCTCAATT ATCGTTTAGA AATGTGAACT GTCCAGATGG GTGACTCATA 1020 

TTGCTGCTGC TACAATCCAC TTTCTTTTCT CATCGGCAGT CTTACGAGCC CATCATAAAC 1080 

TTTTTTTTCC GCGAAATTTG CAATAAACCG GCCAAAAACT TTCTCCAAAT TGTTACGCAA 1140 

TATATACAAT CCATAAGAAT ATCTTCTCAA TGTTTATGAT TTCTTCGCAG CACTTTCTCT 1200 

TCGTGTGCTA ACATCTTATT TTTATAATAT TTCCGCTAAA ATTCCGATTT TTGAGTATTA 1260 

35 ATTTATCGTA AAATTATCAT AATAGCACCG AAAACTACTA AAAATGGTAA AAGCTCCTTT 1320 

TAAATCGGCT CGACATTATC GTATTAAGGA ATCACAAAAT TCTGAGAATG CGTACTGCGC 1380 

AACATATTTG ACGGCAAAAT ATCTCGTAGC GAAAACTACA GTAATTCTTT AAATGACTAC 1440 

TGTAGCGCTT GTGTCGATTT ACGGGCTCAA TTTTTGAAAA TAATTTTTTT TTTCGAATTT 1500 

TGATAACCCG TAAATCGTCA CAACGCTACA GTAGTCATTT AAAGGATTAC TGTAGTTCTA 1560 

40 GCTACGAGAT ATTTTGCGCG CCAAATATGA CTGTAATACG CATTCTCTGA ATTTTGTGTT 1620 

TCCGTAATAA TTTCACAAGA TTTTGGCATT CCACTTTAAA GGCGCACAGG ATTTATTCCA 1680 

ATGGGTCTCG GCACGCAAAA AGTTTGATAG ACTTTTAAAT TCTCCTTGCA TTTTTAATTC 1740 

AATTACTAAA ATTTTCGTGA ATTTTTCTGT TAAAATTTTT AAAATCAGTT TTCTAATATT 1800 

TTCCAGGCTG ACAAACAGAA ACAAAAACAC AACAAACATT TTAAAAATCA GTTTTCAAAT 1860 
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TAAAAATAAC GATTTCTCAT TGAAAATTGT GTTTTATGTT TGCGAAAATA AAAGAGAACT 1920 

GATTCAAAAC AATTTTAACA AAAAAAAACC CCAAAATTCG CCAGAAATCA AGATAAAAAA 1980 

TTCAAGAGGG TCAAAATTTT CCGATTTTAC TGACTTTCAC CTTTTTTTTC GTAGTTCAGT 2040 

GCAGTTGTTG GAGTTTTTGA CGAAAACTAG GAAAAAAATC GATAAAAATT ACTCAAATCG 2100 

5 AGCTGAATTT TGAGGACAAT GTTTAAAAAA AAACACTATT TTTCCAATAA TTTCACTCAT 2160 

TTTCAGACTA AATCGAAAAT CAAATCGTAC TCTGACTACG GGTCAGTAGA GAGGTCAACC 2220 

ATCAGCCGAA GATGATGCGT CAAGATAGAA GGAGCTTGCT AGAGAGGAAC ATTATGATGT 22 80 

TCTCTAGTCA TCTAAAAGTC GATGAAATTY TCGAAGTTCT CATCGCAAAA CAAGTGTTGA 234 0 

ATAGTGATAA TGGAGATATG ATTAATGTGA GTTTTTAATC GAATAATAAT TTTAAAAAAA 24 00 

10 AATTGATAAT ATAAAGAATA TTTTTGCAGT CATGTGGAAC GGTTCGCGAG AAGAGACGGG .2460 

AGATCGTGAA AGCAGTGCAA CGACGGRGAG ATGTGGCGTT CGACGCGTTT TATGATGCTC 2520 

TTCGCTCTAC GGGACACGAA GGACTTGCTG AAGTTCTTGA ACCTCTCGCC AGATCGTAGG 25 80 

TTTTTAAAGT TCGGCGCAAA AGCAAGGGTC TCACGGAAAA AAGAGGCGGA TCGTAATTTT 2640 

GCAACCCACC GGCACGGTTT TTTCCTCCGA AAATCGGAAA TTATGCACTT TCCCAAATAT 2700 

15 TTGAAGTGAA ATATATTTTA TTTACTGAAA GCTCGAGTGA TTATTTATTT TTTAACACTA 2760 

ATTTTCGTGG CGCAAAAGGC CATTTTGTAG ATTTGCCGAA AATACTTGTC ACACACACAC 2820 

ACACACATCT CCTTCAAATA TCCCTTTTTC CAGTGTTGAC TCGAATGCTG TCGAATTCGA 2 880 

GTGTCCAATG TCACCGGCAA GCCATCGTCG GAGCCGCGCA TTGAGCCCCG CCGGCTACAC 2940 

_ TTCACCGACC CGAGTTCACC GTGACAGCGT CTCTTCAGTG TCATCATTCA CTTCTTATCA 3000 

O 20 GGATATCTAC TCAAGAGCAA GATCTCGTTC TCGATCGCGT GCACTTCATT CATCGGATCG 3060 

*S ACACAATTAT TCATCTCCTC CAGTCAACGC ATTTCCCAGC CAACCTTGTA TGTTGATGCG 3120 

Kl AACACTAAAT TCTGAGAATG CGCATTACTC AACATATTTG ACGCGCAAAT ATCTCGTAGC 3180 

03 GAAAAATACA GTAACCCTTT AAATGACTAT TGTAGTGTCG ATTTACGGGC TCGATTTTCG 3240 

W AAACGAATAT ATGCTCGAAT TGTGACAACG AATTTTAATT TGTCATTTTT GTGTTTTCTT 33 00 

til 25 TTGATATTTT TGATCAATTA ATAAATTATT TCCGTAAACA GACACCAGCG CTACAGTACT 3360 

45 CTTTTAAAGA GTTACAGTAG TTTTCGCTTC AAGATATTTT GAAAAGAATT TTAAACATTT 3420 

y TGAAAAAAAA TCATCTAACA TGTGCCAAAA CGCTTTTTTC AAGTTTCGCA GATTTTTTGA 34 80 

s TTTTTTTCAT TCAAGATATG CTTATTAACA CATATAATTA TCATTAATGT GAATTTCTTG 3540 

O TAGAAATTTT GGGCTTTTCG TTCTAGTATG CTCTACTTTT GAAATTGCTC AACGAAAAAA 3600 

S 30 TCATGTGGTT TGTTCATATG AATGACGAAA AATAGCAATT TTTTATATAT TTTCCCCTAT 3660 

fy TCATGTTGTG CAGAAAAATA GTAAAAAAGC GCATGCATTT TTCGACATTT TTTACATCGA 3720 

ffj ACGACAGCTC ACTTCACATG CTGAAGACGA GAGACGCGGA GAAATACCAC ACATCTTTCT 3780 

GCGTCTCTCG TCTTCAGCAT GTGAAATGGG ATCTCGGTCG ATGTAAAAAA ATGTCGAATA 3840 

L& ATGTAAAAAA TGCATGCGTT TTTTTACACT TTTCTGCACA AATGAATAGG GGGAAAATGT 3900 

' 35 ATTAAAATAC ATTTTTTGTA TTTTTCAACA TCACATGATT AACCCCATTA TTTTTTCGTT 3960 

GAGCAACTTA AAAAGTAGAG AATATTAGAG CGAAAACCAA AATTTCTTCA AGATATTACC 4020 

TTTATTGATA ATTATAGATG TTAATAAGCA TATCTTGAAT GAAAGTCAGC AAAAATATGT 4080 

GCGAAACACC TGAAAAAAAT CAAAAATTCT GCGAAAATTG AAAAAATGCA TTAAAATACA 4140 

TTTTTGCATT TTTCTACATC ACATGAATGT AGAAAATTAA AAGGGAAATC AAAATTTCTA 42 00 

40 GAGGATATAA TTGAATGAAA CATTGCGAAA TTAAAATGTG CGAAACGTCA AAAAAGAGGA 4260 

AATTTGGGTA TCAAAATCGA TCCTAAAACC AACACATTTC AGCATCCGCC AACTCTTCAT 432 0 

TCACCGGATG CTCTTCTCTC GGATACAGTT CAAGTCGTAA TCGCTCATTC AGCAAAGCTT 4380 

CTGGACCAAC TCAATACATA TTCCATGAAG AGGATATGAA CTTTGTCGAT GCACCAACCA 4440 

TAAGCCGTGT TTTCGACGAG AAAACCATGT ACAGAAACTT CTCGAGTCCT CGTGGAATGT 4500 

45 GCCTCATCAT AAATAATGAA CACTTTGAGC AGATGCCAAC ACGGAATGGT ACCAAGGCCG 4560 

ACAAGGACAA TCTTACCAAT TTGTTCAGAT GCATGGGCTA TACGGTTATT TGCAAGGACA 4620 

ATCTGACGGG AAGGGTACGG CGAAATTATA TTACCCAAAC GCGAAATTTG CCATTTTGCG 4680 

CCGAAAATGT GGCGCCCGGT CTCGACACGA CAATTTGTGT TAAATGCAAA AATGTATAAT 4740 

TTTGCAAAAA ACAAAATTTT GAACTTCCGC GAAAATGATT TACCTAGTTT CGAAATTTTC 4800 

50 GTTTTTTCCG GCTACATTAT GTGTTTTTTC TTAGTTTTTC TATAATATTT GATGTAAAAA 4860 

ACCGTTTGTA AATTTTCAGA CAATTTTCCG CATACAAAAC TTGATAGCAC GAAATCAATT 4920 
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TTCTGAATTT 


TCAAAATTAT 


CCAAAAATGC 


ACAATTTAAA 


ATTTGTGAAA 


ATTGGCAAAC 


4980 






GGTGTTTCAA 


TATGAAATGT 


ATTTTTAAAA 


ACTTTAAAAA 


CCACTCCGGA 


AAAGCAATAA 


5040 






AAATCAAAAC 


AACGTCACAA 


TTCAAATTCA 


AAAGTTATTC 


ATCCGATTTG 


TTTATTTTTG 


5100 






CAAAATTTGA 


AAAAATCATG 


AAGGATTTAG 


AAAAGTTTTA 


TAACATTTTT 


TCTAGATTTT 


5160 




5 


TCAAAATTTT 


TTTTAACAAA 


TCGAGAAAAA 


GAGAATGAAA 


AATCGATTTT 


AAAAATATCC 


5220 






ACAGCTTCGA 


GAGTTTGAAA 


TTACAGTACT 


CCTTAAAGGC 


GCACACCCCA 


TTTGCATTGG 


5280 






ACCAAAAATT 


TGTCGTGTCG 


AGACCAGGTA 


CCGTAGTTTT 


TGTCGCAAAA 


ATTGCACCAT 


5340 






TGGACAATAA 


ACCTTCCTAA 


TCACCAAAAA 


GTAAAATTGA 


AATCTTCGAA 


AAGCCAAAAA 


5400 






ATTCAAAAAA 


AAAGTCGAAT 


TTCGATTTTT 


TTTTTGGTTT 


TTTGGTCCCA 


AAAACCAAAA 


5460 




10 


AAATCAATTT 


TCTGCAAAAT 


ACCAAAAAGA 


AACCCGAAAA 


AATTTCCCAG 


CCTTGTTCCT 


. 5520 






AATGTAAACT 


GATATTTAAT 


TTCCAGGGAA 


TGCTCCTGAC 


AATTCGAGAC 


TTTGCCAAAC 


5580 






ACGAATCACA 


CGGAGATTCT 


GCGATACTCG 


TGATTCTATC 


ACACGGAGAA 


GAGAATGTGA 


5640 






TTATTGGAGT 


TGATGATATA 


CCGATTAGTA 


CACACGAGAT 


ATATGATCTT 


CTCAACGCGG 


5700 






CAAATGCTCC 


CCGTCTGGCG 


AATAAGCCGA 


AAATCGTTTT 


TGTGCAGGCT 


TGTCGARGCG 


5760 




15 


GTTCGTTTTT 


TATTTTAATT 


TTAATATAAA 


TATTTTAAAT 


AAATTCATTT 


TCAGAACGTC 


5820 






GTGACAATGG 


ATTCCCAGTC 


TTGGATTCTG 


TCGACGGAGT 


TCCTGCATTT 


CTTCGTCGTG 


5880 






GATGGGACAA 


TCGAGACGGG 


CCATTGTTCA 


ATTTTCTTGG 


ATGTGTGCGG 


CCGCAAGTTY 


5940 






AGGTTGCAAT 


TTAATTTCTT 


GAATGAGAAT 


ATTCCTTCAA 


AAAATCTAAA 


ATAGATTTTT 


6000 






ATTCCAGAAA 


GTCCCGATCG 


AAAAATTGCG 


ATATAATTAC 


GAAATTTGTG 


ATAAAATGAC 


6060 




20 


AAACCAATCA 


GCATCGTCGA 


TCTCCGCCCA 


CTTCATCGGA 


TTGGTTTGAA 


AGTGGGCGGA 


6120 






GTGAATTGCT 


GATTGGTCGC 


AGTTTTCAGT 


TTAGAGGGAA 


TTTAAAAATC 


GCCTTTTCGA 


6180 


. 




AAATTAAAAA 


TTGATTTTTT 


CAATTTTTTC 


GAAAAATATT 


CCGATTATTT 


TATATTCTTT 


6240 






GGAGCGAAAG 


CCCCGTCCTG 


TAAACATTTT 


TAAATGATAA 


TTAATAAATT 


TTTGCARCAA 


6300 


m 




GTGTGGAGAA 


AGAAGCCGAG 


CYAAGCTGAC 


ATTCTGATTC 


GRTACGCAAC 


GACAGCTCAA 


6360 


ffi 

m 


25 


TATGTTTCGT 


GGAGAAACAG 


TGCTCGTGGA 


TCATGGTTCA 


TTCAAGCCGT 


CTGTGAAGTG 


6420 




TTCTCGACAC 


ACGYAAAGGA 


TATGGATGTT 


GTTGAGCTGC 


TGACTGAAGT 


CAATAAGAAG 


6480 


r~ 




GTCGYTTGTG 


GATTTCAGAC 


ATCACAGGGA 


TCGAATATTT 


TGAAACAGAT 


GCCARAGGTA 


6540 






CTTGAAACAA 


ACAATGCATG 


TCTAACTTTT 


AAGGACACAG 


AAAAATAGGC 


AGAGGCTCCT 


6600 






TTTGCAAGCC 


TGCCGCGCGT 


CAACCTAGAA 


TTTTAGTTTT 


TAGCTAAAAT 


GATTGATTTT 


6660 




30 


GAATATTTTA 


TGCTAATTTT 


TTTGCGTTAA 


ATTTTGAAAT 


AGTCACTATT 


TATCGGGTTT 


6720 


01 




CCAGTAAAAA 


ATGTTTATTA 


GCCATTGGAT 


TTTACTGAAA 


ACGAAAATTT 


GTAGTTTTTC 


6780 


Ft; 




AACGAAATTT 


ATCGATTTTT 


AAATGTAAAA 


AAAAATAGCG 


AAAATTACAT 


CAACCATCAA 


6840 


: ^ 




GCATTTAAGC 


CAAAATTGTT 


AACTCATTTA 


AAAATTAATT 


CAAAGTTGTC 


CACGAGTATT 


6900 






ACACGGTTGG 


CGCGCGGCAA 


GTTTGCAAAA 


CGACGCTCCG 


CCTCTTTTTC 


TGTGCGGCTT 


6960 




35 


GAAAACAAGG 


GATCGGTTTA 


GATTTTTCCC 


CAAAATTTAA 


ATTAAATTTC 


AGATGACATY 


7020 






CCGCCTGCTC 


AAAAAGTTCT 


ACTTTTGGCC 


GGAAGCACGA 


AACTCTGCCG 


TCTAAAATTC 


7080 






ACTCGTGATT 


CATTGCCCAA 


TTGATAATTG 


TCTGTATCTT 


CTCCCCCAGT 


TCTCTTTCGC 


7140 






CCAATTAGTT 


TAAAACCATG 


TGTATATTGT 


TATCCTATAC 


TCATTTCACT 


TTATCATTCT 


7200 






ATCATTTCTC 


TTCCCATTTT 


CACACATTTC 


CATTTCTCTA 


CGATAATCTA 


AAATTATGAC 


7260 




40 


GTTTGTGTCT 


CGAACGCATA 


ATAATTTTAA 


TAACTCGTTT 


TGAATTTGAT 


TAGTTGTTGT 


7320 






GCCCAGTATA 


TATGTATGTA 


CTATGCTTCT 


ATCAACAAAA 


TAGTTTCATA 


GATCATCACC 


7380 






CCAACCCCAC 


CAACCTACCG 


TACCATATTC 


ATTTTTGCCG 


GGAATCAATT 


TCGATTAATT 


7440 






TTAACCTATT 


TTTTCGCCAC 


AAAAAATCTA 


ATATTTGAAT 


TAACGAATAG 


CATTCCCATC 


7500 






TCTCCCGTGC 


CGGAATGCCT 


CCCGGCCTTT 


TAAAGTTCGG 


AACATTTGGC 


AATTATGTAT 


7560 




45 


AAATTTGTAG 


GTCCCCCCCA 


TCATTTCCCG 


CCCATCATST 


CAAATTGCAT 


TCTTTTTTCG 


7620 






CCGTGATATC 


CCGATTCTGG 


TCAGCAAAGA 


TCT 






7653 



(2) INFORMATION FOR SEQ ID NO:2; 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 5 03 amino acids 
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(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
5 (ix) FEATURE: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 : 







Met 


Met 


Arg 


Gin 


Asp 


Arg 


Arg 


Ser 


Leu 


Leu 


Glu 


Arg 


Asn 


He 


Met Met 






1 








5 










10 










15 






Phe 


Ser 


Ser 


His 


Leu 


Lys 


Val 


Asp 


Glu 


He 


Xaa 


Glu 


Val 


Leu 


He Ala 




10 








20 










25 










30 








Lys 


Gin 


Val 
35 


Leu 


Asn 


Ser 


Asp 


Asn 
40 


Gly 


Asp 


Met 


He 


Asn 
45 


Ser 


Cys Gly 






Thr 


Val 
50 


Arg 


Glu 


Lys 


Arg 


Arg 
55 


Glu 


He 


Val 


Lys 


Ala 
60 


Val 


Gin 


Arg Arg 




15 


Xaa 
65 


Asp 


Val 


Ala 


Phe 


Asp 
70 


Ala 


Phe 


Tyr 


Asp 


Ala 
75 


Leu 


Arg 


Ser 


Thr Gly 
80 






His 


Glu 


Gly 


Leu 


Ala 


Glu 


Val 


Leu 


Glu 


Pro 


Leu' 


Ala Arg 


Ser 


Val Asp 














85 










90 










95 


p 




Ser 


Asn 


Ala 


Val 


Glu 


Phe 


Glu 


Cys 


Pro 


Met 


Ser 


Pro 


Ala 


Ser 


His Arg 




20 








100 










105 










110 




03 




Arg 


Ser 


Arg 


Ala 


Leu 


Ser 


Pro 


Ala 


Gly 


Tyr 


Thr 


Ser 


Pro 


Thr 


Arg Val 










115 










120 










125 










His 


Arg 


Asp 


Ser 


Val 


Ser 


Ser 


Val 


Ser 


Ser 


Phe 


Thr 


Ser 


Tyr 


Gin Asp 


ry 






13 0 










135 










140 










25 


He 


Tyr 


Ser 


Arg 


Ala 


Arg 


Ser 


Arg 


Ser 


Arg 


Ser 


Arg Ala 


Leu 


His Ser 


HJ 




145 










150 










155 








160 






Ser 


Asp 


Arg 


His 


Asn 


Tyr 


Ser 


Ser 


Pro 


Pro 


Val 


Asn 


Ala 


Phe 


Pro Ser 














165 










170 










175 






Gin 


Pro 


Ser 


Ser 


Ala 


Asn 


Ser 


Ser 


Phe 


Thr 


Gly 


Cys 


Ser 


Ser 


Leu Gly 




30 








180 










185 










190 








Tyr 


Ser 


Ser 


Ser 


Arg 


Asn 


Arg 


Ser 


Phe 


Ser 


Lys 


Ala 


Ser 


Gly 


Pro Thr 


t y 








195 










200 










2 05 










Gin 


Tyr 


He 


Phe 


His 


Glu 


Glu 


Asp 


Met 


Asn 


Phe 


Val 


Asp 


Ala 


Pro Thr 








210 










215 










220 










35 


He 
225 


Ser 


Arg 


Val 


Phe 


Asp 
230 


Glu 


Lys 


Thr 


Met 


Tyr 
235 


Arg 


Asn 


Phe 


Ser Ser 
240 






Pro 


Arg 


Gly 


Met 


Cys 
245 


Leu 


He 


He 


Asn 


Asn 
250 


Glu 


His 


Phe 


Glu 


Gin Met 
255 






Pro 


Thr 


Arg 


Asn 


Gly 


Thr 


Lys 


Ala 


Asp 


Lys 


Asp 


Asn 


Leu 


Thr 


Asn Leu 




40 








260 










265 










270 








Phe 


Arg 


Cys 
275 


Met 


Gly 


Tyr 


Thr 


Val 
280 


He 


Cys 


Lys 


Asp 


Asn 
285 


Leu 


Thr Gly 






Arg 


Gly 
290 


Met 


Leu 


Leu 


Thr 


He 
295 


Arg 


Asp 


Phe 


Ala 


Lys 
300 


His 


Glu 


Ser His 




45 


Gly 


Asp 


Ser 


Ala 


He 


Leu 


Val 


He 


Leu 


Ser 


His 


Gly Glu 


Glu 


Asn Val 






305 










310 










315 








320 






He 


He 


Gly 


Val 


Asp 


Asp 


He 


Pro 


He 


Ser 


Thr 


His 


Glu 


He 


Tyr Asp 



325 330 335 
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Leu Leu Asn Ala Ala Asn Ala Pro Arg Leu Ala Asn Lys Pro Lys lie 

340 345 350 

Val Phe Val Gin Ala Cys Arg Xaa Glu Arg Arg Asp Asn Gly Phe Pro 
355 360 365 

5 Val Leu Asp Ser Val Asp Gly Val Pro Ala Phe Leu Arg Arg Gly Trp 
370 375 380 

Asp Asn Arg Asp Gly Pro Leu Phe Asn Phe Leu Gly Cys Val Arg Pro 
385 390 395 400 

Gin Val Xaa Gin Val Trp Arg Lys Lys Pro Ser Xaa Ala Asp lie Leu 
10 405 410 415 

lie Arg Tyr Ala Thr Thr Ala Gin Tyr Val Ser Xaa Arg Asn Ser Ala 

420 425 430 

Arg Gly Ser Trp Phe He Gin Ala Val Cys Glu Val Phe Ser Thr His 
435 440 445 

15 Xaa Lys Asp Met Asp Val Val Glu Leu Leu Thr Glu Val Asn Lys Lys 
450 455 460 

Val Xaa Cys Gly Phe Gin Thr Ser Gin Gly Ser Asn He Leu Lys Gin 
465 470 475 480 

Met Pro Xaa Met Thr Xaa Arg Leu Leu Lys Lys Phe Tyr Phe Trp Pro 
20 485 490 495 

Glu Ala Arg Asn Ser Ala Val 
500 

(2) INFORMATION FOR SEQ ID NO: 3: 



yj (i) SEQUENCE CHARACTERISTICS: 

IU 25 (A) LENGTH: 1373 base pairs 

=P (B) TYPE: nucleic acid 

W (C) STRANDEDNESS : single 

s (D) TOPOLOGY: linear 

O 

yl (ii) MOLECULE TYPE : cDNA 

fy 30 (ix) FEATURE: 

\u 

p (A) NAME/ KEY : Coding Sequence 

U (B) LOCATION: 18... 1229 

(D) OTHER INFORMATION: /product^ "human interleukin-1 beta 
convertase" 

35 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:3: 

AAAAGGAGAG AAAAGCC ATG GCC GAC AAG GTC CTG AAG GAG AAG AGA AAG 50 

Met Ala Asp Lys Val Leu Lys Glu Lys Arg Lys 
1 5 10 

40 CTG TTT ATC CGT TCC ATG GGT GAA GGT ACA ATA AAT GGC TTA CTG GAT 98 
Leu Phe lie Arg Ser Met Gly Glu Gly Thr lie Asn Gly Leu Leu Asp 
15 20 25 

GAA TTA TTA CAG ACA AGG GTG CTG AAC AAG GAA GAG ATG GAG AAA GTA 146 
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r 







Glu 


Leu 


Leu 
30 


Gin 


Thr 


Arg 


Val 


Leu 
35 


Asn 


Lys 


Glu 


Glu 


Met 
40 


Glu 


Lys 


Val 






5 


AAA 
Lys 


CGT 
Arg 
45 


GAA 
Glu 


AAT 
Asn 


GCT 
Ala 


ACA 
Thr 


GTT 
Val 
50 


ATG 
Met 


GAT 
Asp 


AAG 
Lys 


ACC 
Thr 


CGA 
Arg 
55 


GCT 
Ala 


TTG 
Leu 


ATT 
He 


GAC 
Asp 


194 






TCC 
Ser 
60 


GTT 
Val 


ATT 
He 


CCG 
Pro 


AAA 
Lys 


GGG 
Gly 
65 


GCA 
Ala 


CAG 
Gin 


GCA 
Ala 


TGC 
Cys 


CAA 
Gin 
70 


ATT 
He 


TGC 
Cys 


ATC 
He 


ACA 
Thr 


TAC 
Tyr 
75 


242 




10 


ATT 
He 


TGT 
Cys 


GAA 
Glu 


GAA 
Glu 


GAC 
Asp 
80 


AGT 
Ser 


TAC 
Tyr 


CTG 
Leu 


GCA 
Ala 


GGG 
Gly 
85 


ACG 
Thr 


CTG 
Leu 


GGA 
Gly 


CTC 
Leu 


TCA 
Ser 
90 


GCA 
Ala 


290 






GAT 
Asp 


CAA 
Gin 


ACA 
Thr 


TCT 
Ser 
95 


GGA 
Gly 


AAT 
Asn 


TAC 
Tyr 


CTT 
Leu 


AAT 
Asn 
100 


ATG 
Met 


CAA 
Gin 


GAC 
Asp 


TCT 
Ser 


CAA 
Gin 
105 


GGA 
Gly 


GTA 
Val 


338 




15 


CTT 
Leu 


TCT 
Ser 


TCC 
Ser 
110 


TTT 
Phe 


CCA 
Pro 


GCT 
Ala 


CCT 
Pro 


CAG 
Gin 

115 


GCA 
Ala 


GTG 
Val 


CAG 
Gin 


GAC 
Asp 


AAC 
Asn 
120 


CCA 
Pro 


GCT 
Ala 


ATG 
Met 


386 




20 


CCC 
Pro 


ACA 
Thr 
125 


TCC 
Ser 


TCA 
Ser 


GGC 
Gly 


TCA 
Ser 


GAA 
Glu 
130 


GGG 
Gly 


AAT 
Asn 


GTC 
Val 


AAG 
Lys 


CTT 
Leu 
135 


TGC 
Cys 


TCC 
Ser 


CTA 
Leu 


GAA 
Glu 


434 


SIS 

5 




GAA 
Glu 
140 


GCT 
Ala 


CAA 
Gin 


AGG 
Arg 


ATA 
He 


TGG 
Trp 
145 


AAA 
Lys 


CAA 
Gin 


AAG 
Lys 


TCG 
Ser 


GCA 
Ala 
150 


GAG 
Glu 


ATT 
He 


TAT 
Tyr 


CCA 
Pro 


ATA 
He 
155 


482 




25 


ATG 
Met 


GAC 
Asp 


AAG 
Lys 


TCA 
Ser 


AGC 
Ser 
160 


CGC 
Arg 


ACA 
Thr 


CGT 
Arg 


CTT 
Leu 


GCT 
Ala 
165 


CTC 
Leu 


ATT 
He 


ATC 
He 


TGC 
Cys 


AAT 
Asn 
170 


GAA 
Glu 


530 






GAA 
Glu 


TTT 
Phe 


GAC 
Asp 


AGT 
Ser 
175 


ATT 
He 


CCT 
Pro 


AGA 
Arg 


AGA 
Arg 


ACT 
Thr 
180 


GGA 
Gly 


GCT 
Ala 


GAG 
Glu 


GTT 
Val 


GAC 
Asp 
185 


ATC 
He 


ACA 
Thr 


578 




30 


GGC 
Gly 


ATG 
Met 


ACA 
Thr 
190 


ATG 
Met 


CTG 
Leu 


CTA 
Leu 


CAA 
Gin 


AAT 
Asn 
195 


CTG 
Leu 


GGG 
Gly 


TAC 
Tyr 


AGC 
Ser 


GTA 
Val 
200 


GAT 
Asp 


GTG 
Val 


AAA 
Lys 


626 




35 


AAA 
Lys 


AAT 
Asn 
205 


CTC 
Leu 


ACT 
Thr 


GCT 
Ala 


TCG 
Ser 


GAC 
Asp 
210 


ATG 
Met 


ACT 
Thr 


ACA 
Thr 


GAG 
Glu 


CTG 
Leu 
215 


GAG 
Glu 


GCA 
Ala 


TTT 
Phe 


GCA 
Ala 


674 






CAC 
His 
220 


CGC 
Arg 


CCA 
Pro 


GAG 
Glu 


CAC 
His 


AAG 
Lys 
225 


ACC 
Thr 


TCT 
Ser 


GAC 
Asp 


AGC 
Ser 


ACG 
Thr 
230 


TTC 
Phe 


CTG 
Leu 


GTG 
Val 


TTC 
Phe 


ATG 
Met 
235 


722 
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r 



TCT CAT GGT ATT CGG GAA GGC ATT TGT GGG AAG AAA CAC TCT GAG CAA 770 
Ser His Gly lie Arg Glu Gly He Cys Gly Lys Lys His Ser Glu Gin 
240 245 250 

GTC CCA GAT ATA CTA CAA CTC AAT GCA ATC TTT AAC ATG TTG AAT ACC 818 
5 Val Pro Asp He Leu Gin Leu Asn Ala He Phe Asn Met Leu Asn Thr 
255 260 265 

AAG AAC TGC CCA AGT TTG AAG GAC AAA CCG AAG GTG ATC ATC ATC CAG 866 
Lys Asn Cys Pro Ser Leu Lys Asp Lys Pro Lys Val He He He Gin 
270 275 280 

10 GCC TGC CGT GGT GAC AGC CCT GGT GTG GTG TGG TTT AAA GAT TCA GTA 914 
Ala Cys Arg Gly Asp Ser Pro Gly Val Val Trp Phe Lys Asp Ser Val 
285 290 295 

GGA GTT TCT GGA AAC CTA TCT TTA CCA ACT ACA GAA GAG TTT GAG GAT 962 
Gly Val Ser Gly Asn Leu Ser Leu Pro Thr Thr Glu Glu Phe Glu Asp 
15 300 305 310 315 

GAT GCT ATT AAG AAA GCC CAC ATA GAG AAG GAT TTT ATC GCT TTC TGC 1010 
D Asp Ala He Lys Lys Ala His He Glu Lys Asp Phe He Ala Phe Cys 

Ul 320 325 330 

ffl TCT TCC ACA CCA GAT AAT GTT TCT TGG AGA CAT CCC ACA ATG GGC TCT 1058 

EQ 2 0 Ser Ser Thr Pro Asp Asn Val Ser Trp Arg His Pro Thr Met Gly Ser 

335 340 345 



GTT TTT ATT GGA AGA CTC ATT GAA CAT ATG CAA GAA TAT GCC TGT TCC 1106 
Val Phe He Gly Arg Leu He Glu His Met Gin Glu Tyr Ala Cys Ser 
350 355 360 



ill 25 TGT GAT GTG GAG GAA ATT TTC CGC AAG GTT CGA TTT TCA TTT GAG CAG 1154 
fu Cys Asp Val Glu Glu He Phe Arg Lys Val Arg Phe Ser Phe Glu Gin 

^ 365 370 375 

CCA GAT GGT AGA GCG CAG ATG CCC ACC ACT GAA AGA GTG ACT TTG ACA 12 02 
Pro Asp Gly Arg Ala Gin Met Pro Thr Thr Glu Arg Val Thr Leu Thr 
30 380 385 390 395 

AGA TGT TTC TAG CTC TTC CCA GGA CAT TAAAATAAGG AAACTGTATG AATGTCT 1256 
Arg Cys Phe Tyr Leu Phe Pro Gly His 
400 

GCGGGCAGGA AGTGAAGAGA TCGTTCTGTA AAAGGTTTTT GGAATTATGT CTGCTGAATA 1316 
35 ATAAACTTTT TTTGAAATAA TAAATCTGGT AGAAAAATGA AAAAAAAAAA AAAAAAA 1373 

(2) INFORMATION FOR SEQ ID NO : 4 : 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 404 amino acids 
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(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
5 (ix) FEATURE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:4: 

Met Ala Asp Lys Val Leu Lys Glu Lys Arg Lys Leu Phe lie Arg Ser 

15 10 15 

Met Gly Glu Gly Thr lie Asn Gly Leu Xaa Asp Glu Leu Leu Gin Thr 
10 20 25 30 

Arg Val Leu Asn Lys Glu Glu Met Glu Lys Val Lys Arg Glu Asn Ala 

35 40 45 

Thr Val Met Asp Lys Thr Arg Ala Leu lie Asp Ser Val lie Pro Lys 
50 55 60 

15 Xaa Ala Gin Ala Cys Gin He Cys He Thr Tyr He Cys Glu Glu Asp 
65 70 75 80 

Ser Tyr Leu Ala Gly Thr Leu Gly Leu Ser Ala Asp Gin Thr Ser Gly 
_ 85 90 95 

Q Asn Tyr Leu Asn Met Gin Asp Ser Gin Gly Val Leu Ser Ser Phe Pro 

S 20 100 105 110 

CQ Ala Pro Gin Ala Val Gin Asp Asn Pro Ala Met Pro Thr Ser Ser Gly 

CO 115 120 125 

S3 Ser Glu Gly Asn Val Lys Leu Cys Ser Leu Glu Glu Ala Gin Arg He 

nj 130 135 140 

=P 25 Trp Lys Gin Lys Ser Ala Glu He Tyr Pro He Met Asp Lys Ser Ser 
Uj 145 150 155 160 

s Arg Thr Arg Leu Ala Leu He He Cys Asn Glu Glu Phe Asp Ser He 

p 165 170 175 

U\ Pro Arg Arg Thr Gly Ala Glu Val Asp He Thr Gly Met Thr Met Leu 

Aj 30 180 185 190 

n= Leu Gin Asn Leu Gly Tyr Ser Val Asp Val Lys Lys Asn Leu Thr Ala 

j-5 195 200 205 

y[ Ser Asp Met Thr Thr Glu Leu Glu Ala Phe Ala His Arg Pro Glu His 

210 215 220 

35 Lys Thr Ser Asp Ser Thr Phe Leu Val Phe Met Ser His Gly He Arg 
225 230 235 240 

Glu Gly He Cys Gly Lys Lys His Ser Glu Gin Val Pro Asp He Leu 

245 250 255 

Gin Leu Asn Ala He Phe Asn Met Leu Asn Thr Lys Asn Cys Pro Ser 
40 260 265 270 

Leu Lys Asp Lys Pro Lys Val He He He Gin Ala Xaa Arg Xaa Asp 

275 280 * 285 

Ser Pro Gly Val Val Trp Phe Lys Asp Ser Val Gly Val Ser Gly Asn 
290 295 300 

45 Leu Ser Leu Pro Thr Thr Glu Glu Phe Glu Asp Asp Ala He Lys Lys 
305 310 315 320 

Ala His He Xaa Lys Asp Phe He Ala Phe Cys Ser Ser Thr Pro Asp 
325 330 335 
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Asn Val Ser Xaa Arg His Pro Thr Met Gly Ser Val Phe He Gly Arg 

340 345 350 

Leu He Glu His Met Gin Glu Tyr Xaa Cys Ser Cys Asp Val Glu Glu 

355 360 365 

He Phe Arg Lys Val Arg Phe Ser Phe Glu Gin Pro Asp Gly Arg Ala 

370 375 380 

Gin Met Pro Thr Thr Xaa Arg Val Xaa Leu Thr Arg Cys Phe Tyr Leu 
385 390 395 400 

Phe Pro Gly His 



(2) INFORMATION FOR SEQ ID NO:5: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 505 amino acids 

(B) TYPE: amino acid 

15 (C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : protein 

O (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 5 : 

W Met Met Arg Gin Asp Arg Trp Leu Leu Glu Arg Asn He Leu Glu Phe 

W 20 1 5 10 15 

£y Ser Ser Lys Leu Gin Ala Asp Leu He Leu Asp Val Leu He Ala Lys 

fU 20 25 30 

JS Gin Val Leu Asn Ser Asp Asn Gly Asp Val He Asn Ser Cys Arg Thr 

UJ 35 40 45 

= 25 Glu Arg Asp Asn Glu Lys Glu He Val Lys Ala Val Gin Arg Arg Gly 

O 50 55 60 

fll Asp Glu Ala Phe Asp Ala Phe Tyr Asp Ala Leu Arg Asp Thr Gly His 

fij 65 70 75 80 

fll Asn Asp Leu Ala Asp Val Leu Met Pro Leu Ser Arg Pro Xaa Xaa Xaa 

g 30 85 90 95 

y. Asn Pro Val Pro Met Glu Cys Pro Met Ser Pro Ser Ser His Arg Arg 

100 105 HO 

Ser Arg Ala Leu Ser Pro Pro Xaa Tyr Ala Ser Pro Thr Arg Val His 
115 120 125 

35 Arg Asp Ser He Ser Ser Val Ser Ser Phe Thr Ser Thr Tyr Gin Asp 
130 135 140 

Val Tyr Ser Arg Ala Arg Ser Ser Ser Arg Ser Ser Arg Pro Leu Gin 
145 150 155 160 

Ser Ser Asp Arg His Asn Tyr Met Ser Ala Ala Thr Ser Phe Pro Ser 
40 165 170 * 175 

Gin Pro Xaa Ser Ala Asn Ser Ser Phe Thr Gly Cys Ala Ser Leu Gly 

180 185 190 

Tyr Ser Ser Ser Arg Asn Arg Ser Phe Ser Lys Thr Ser Ala Gin Ser 
195 200 205 

45 Gin Tyr He Phe His Glu Glu Asp Met Asn Tyr Val Asp Ala Pro Thr 
210 215 220 
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He His Arg Val Phe Asp Glu Lys Thr Met Tyr Arg Asn Phe Ser Ser 
225 230 235 240 

Pro Arg Gly Leu Cys Leu He lie Asn Asn Glu His Phe Glu Gin Met 
245 250 255 

5 Pro Thr Arg Asn Gly Thr Lys Ala Asp Lys Asp Asn Leu Thr Asn He 
260 265 270 

Phe Arg Cys Met Gly Tyr Thr Val He Cys Lys Asp Asn Leu Thr Gly 

275 280 285 

Arg Glu Met Leu Ser Thr He Arg Ser Phe Gly Arg Asn Asp Met His 
10 290 295 300 

Gly Asp Ser Ala He Leu Val He Leu Ser His Gly Glu Xaa Asn Val 
305 310 315 320 

He He Gly Val Asp Asp Val Ser Val Asn Val His Glu He Tyr Asp 
325 330 335 

15 Leu Leu Asn Ala Ala Asn Ala Pro Arg Leu Ala Asn Lys Pro Lys Leu 

340 345 350 

Val Phe Val Gin Ala Cys Arg Gly Glu Arg Arg Asp Asn Gly Phe Pro 

355 360 365 

Val Leu Asp Ser Val Asp Gly Val Pro Ser Leu He Arg Arg Gly Trp 
20 370 375 380 

Asp Asn Arg Asp Gly Pro Leu Phe Asn Phe Leu Gly Cys Val Arg Pro 
p 385 390 395 400 

Jj Gin Val Gin Gin Val Trp Arg Lys Lys Pro Ser Gin Ala Asp Met Leu 

-fB 405 410 415 

£n 25 He Ala Tyr Ala Thr Thr Ala Gin Tyr Val Ser Trp Arg Asn Ser Ala 
ftj 420 425 430 

ry Arg Gly Ser Trp Phe He Gin Ala Val Cys Glu Val Phe Ser Leu His 

J* 435 440 445 

yj Ala Lys Asp Met Asp Val Val Glu Leu Leu Thr Glu Val Asn Lys Lys 

30 450 455 460 

Val Ala Cys Gly Phe Gin Thr Ser Gin Gly Ser Asn He Leu Lys Gin 
S 465 470 475 480 

Z\ Met Pro Glu Leu Thr Ser Arg Leu Leu Lys Lys Phe Tyr Phe Trp Pro 

485 490 495 

~ 35 Glu Asp Arg Gly Arg Asn Ser Ala Val 

H 500 505 

Ms 

(2) INFORMATION FOR SEQ ID NO : 6 ; 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 479 amino acids 

40 <B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE; protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

45 Thr Val Ser Leu Ser Leu He He Ala Arg Gin Val Leu Asn Ser Asp 
15 10 15 
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10 



15 



20 



25 



30 



If! 

s V 



35 



40 



45 



50 



Asn Gly Asp Met He Asn Ser Cys Arg Thr Glu Arg Asp Asn Glu Lys 

25 

Glu lie Val Lys Ala Val Gin Arg Arg Gly Asp Glu Ala Phe Asp Ala 

Phe Tyr Asp Ala Leu Arg Asp Thr Gly His Asn Asp Leu Ala Asp Val 

Leu Met Pro Leu Ser Arg Pro Val Asp Ser Asn Pro Val Pro Met Glu 

Cys Pro Met Ser Pro Ser Ser His Arg Arg Ser Arg Ala Leu Ser Pro 

Pro Xaa Tyr Ala Ser Pro Thr Arg Val His Arg Asp Ser lie Ser Ser 

100 105 
Val Ser Ser Phe Thr Ser Thr Tyr Gin Asp Val Tyr Ser Arg Ala Thr 

115 120 
Ser Ser Ser Pro Leu Gin Thr Ser Asp Arg His Asn Tyr Val Ser Ala 

Ser Tnr Ser Phe Gin Ser Gin Pro Ala Ser Ala Asn Ser Ser Phe Thr 

150 155 
III Ser Ala Ser Leu Gly Tyr Ser Ser Ser Arg Thr Arg Ser Tyr Ser 

165 170 
Lys Thr Ser Ala His Ser Gin Tyr lie Phe His Glu Glu Asp Met Asn 



180 185 



| Tyr Val Asp Ala Pro Thr lie His Arg Val Phe Asp Glu Lys Thr Met 



195 

Tyr Arg Asn Phe Ser Thr Pro Arg Gly Leu Cys Leu lie lie Asn Asn 



200 



9-ic 220 
210 21b 



Glu His Phe Glu Gin Met Pro Thr Arg Asn Gly Thr Lys Pro Asp Lys 



230 235 240 

Asp Asn lie Ser Asn lie Phe Arg Cys Met Gly Tyr lie Val Hi. Cys 

245 250 



Lys Asp Asn Leu Thr Gly Arg Glu Met Met Ser Thr lie Arg Ser Phe 



260 



265 



Gly Arg Asn Asp Thr His Gly Asp Ser Ala lie Leu Val He Leu Ser 

nc 280 285 

His Gly Glu Xaa Asn Val He He Gly Val Asp Asp Val Ser Val Asn 



Val hIs Glu lie Tyr Xaa Leu Leu Asn Ala Ala Asn Ala Pro Arg Leu 

HI Asn Lys Pro Lys Leu Val Phe Val Gin III Cys Arg Gly Glu Arg 
325 330 



295 



Arg Asp Val Gly Phe Pro Val Leu Asp Ser Val Asp Gly Val Pro Ser 

Leu lie Arg Arg Gly Trp Asp Lys a" Asp Gly Pro Leu Phe Asn Phe 

Leu Gly Cys Val Arg Pro Gin Ala Gin Gin Val Trp Arg Lys Lys Pro 

S er IZ Ala Asp Met Leu lie Ala Tyr Ala Thr Thr Ala Gin Tyr Val 

III Trp Arg Asn Ser Ala Arg Gly Ser Trp Phe He Gin Ala Val Cys 

405 410 
Glu Val Phe Ser Leu His Ala Lys Asp Met Asp Val Val Glu Leu Leu 
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420 425 430 

Thr Glu Val Asn Lys Lys Val Ala Cys Gly Phe Gin Thr Ser Gin Gly 

435 440 445 

Ala Asn He Leu Lys Gin Met Pro Glu Leu Thr Ser Arg Leu Leu Lys 
5 450 455 460 

Lys Phe Tyr Phe Trp Pro Glu Asp Arg Asn Arg Ser Ser Ala Val 
465 470 475 

(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 
10 (A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 
f*| 15 TCATCGACTT TTAGATGACT AGAGAACATC 

S (i) SEQUENCE CHARACTERISTICS: 

S (A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

+: 2 0 (C) STRANDEDNESS: single 

^ (D) TOPOLOGY: linear 
s 

•fees? 

J (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 8 : 

[U GTTGCACTGC TTTCACGATC TCCCGTCTCT 

M= (2) INFORMATION FOR SEQ ID NO : 9 : 

25 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

30 (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 9 : 

GTTTAATTAC CCAAGTTTGA G 

(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 21 base pairs 



30 



30 



21 
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(B) 
(C) 
(D) 



TYPE: nucleic acid 
STRAND EDNESS : s ingle 
TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 
GGTTTTAACC AGTTACTCAA G 

(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 
CCGGTGACAT TGGACACTC 

(2) INFORMATION FOR SEQ ID NO : 12 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 
ACTATTCAAC ACTTG 

(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 171 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(ix) FEATURE: 

(D) OTHER INFORMATION: Xaa at position 117 is Ala or Val . 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

Met Leu Thr Val Gin Val Tyr Arg Thr Ser Gin Lys Cys Ser Ser Ser 
15 10 15 
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Lys His Val Val Glu Val Leu Leu Asp Pro Leu Gly Thr Ser Phe Cys 

20 25 30 

Ser Leu Leu Pro Pro Pro Leu Leu Leu Tyr Glu Thr Asp Arg Gly Val 
35 40 45 

5 Asp Gin Gin Asp Gly Lys Asn His Thr Gin Ser Pro Gly Cys Glu Glu 
50 55 60 

Ser Asp Ala Gly Lys Glu Glu Leu Met Lys Met Arg Leu Pro Thr Arg 
65 70 75 80 

Ser Asp Met He Cys Gly Tyr Ala Cys Leu Lys Gly Asn Ala Ala Met 
10 85 90 95 

Arg Asn Thr Lys Arg Gly Ser Trp Tyr He Glu Ala Leu Thr Gin Val 

100 105 110 

Phe Ser Glu Arg Xaa Cys Asp Met His Val Ala Asp Met Leu Val Lys 
115 120 125 

15 Val Asn Ala Leu He Lys Glu Arg Glu Gly Tyr Ala Pro Gly Thr Glu 
130 135 140 

Phe His Arg Cys Lys Glu Met Ser Glu Tyr Cys Ser Thr Leu Cys Gin 
145 150 155 160 

Gin Leu Tyr Leu Phe Pro Gly Tyr Pro Pro Thr 
20 165 170 

*J (2) INFORMATION FOR 3SQ ID NO: 14: 

5f (i) SEQUENCE CHARACTERISTICS: 

~* (A) LENGTH: 402 amino acids 

J*j (B) TYPE: amino acid 

^ 25 (C) STRANDEDNESS : single 

=F (D) TOPOLOGY: linear 

as ' 

3 (ii) MOLECULE TYPE: protein 

0^ (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

fU Met Ala Asp Lys lie Leu Arg Ala Lys Arg Lys Gin Phe lie Asn Ser 

O 30 1 5 10 15 

y= val Ser He Gly Thr He Asn Gly Leu Leu Asp Glu Leu Leu Glu Lys 

20 25 30 

Arg Val Leu Asn Gin Glu Glu Met Asp Lys He Lys Leu Ala Asn He 
35 40 45 

35 Thr Ala Met Asp Lys Ala Arg Asp Leu Cys Asp His Val Ser Lys Lys 
50 55 60 

Gly Pro Gin Ala Ser Gin He Phe He Thr Tyr He Cys Asn Glu Asp 
65 70 75 80 

Cys Tyr Leu Ala Gly He Leu Glu Leu Gin Ser Ala Pro Ser Ala Glu 
40 85 90 - 95 

Thr Phe Val Ala Thr Glu Asp Ser Lys Gly Gly His Pro Ser Ser Ser 

100 105 110 

Glu Thr Lys Glu Glu Gin Asn Lys Glu Asp Gly Thr Phe Pro Gly Leu 
115 120 125 

45 Thr Gly Thr Leu Lys Phe Cys Pro Leu Glu Lys Ala Gin Lys Leu Trp 
130 135 140 
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Lys Glu Asn 
145 

Thr Arg Leu 

Pro Arg Val 

Glu Asp Leu 
195 

Glu Met Val 

210 
Thr Ser Asp 
225 

Gly lie Cys 

Val Asp Thr 

Lys Asp Lys 
275 

Gin Gly Val 

290 
Phe Leu Thr 
305 

He Glu Lys 

Ser Trp Arg 

Lys His Met 
355 

Arg Lys Val 
370 

Pro Thr Ala 
385 

Gly His 



Pro Ser 

Ala Leu 
165 
Gly Ala 
180 

Gly Tyr 

Lys Glu 

Ser Thr 

Gly Thr 
245 
He Phe 
260 

Pro Lys 

Val Leu 

Asp Ala 

Asp Phe 
325 
His Pro 
340 

Lys Glu 



Glu He 
150 

He He 

Gin Val 

Thr Val 

Val Lys 
215 
Phe Leu 
230 

Thr Tyr 

Gin Met 

Val He 

Leu Lys 
295 
He Phe 
310 

He Ala 
Val Arg 
Tyr Ala 



Tyr Pro 
Cys Asn 



He Met Asn 
155 

Thr Glu Phe 
170 

Arg Glu Met 



Asp Leu 
185 

Lys Val Lys Glu Asn 
200 

Glu Phe 



Val Phe 
Ser Asn 



Ala Ala Cys 
220 

Met Ser His 

235 
Glu Val Ser 
250 

Thr Leu Lys 



Met Asn 
265 

He He Gin Ala Cys 
280 

Asp Ser 



Arg Phe 
Asp Arg 



Ser Phe 
375 
Val Thr 
390 



Glu Asp 

Phe Cys 

Gly Ser 
345 
Trp Ser 
360 

Glu Gin 
Leu Thr 



Val Arg Asp 
300 

Asp Gly He 
315 

Ser Ser Thr 
330 

Leu Phe He 
Cys Asp Leu 



Pro Glu Phe 
380 

Lys Arg Phe 
395 



Thr Thr 

Gin His 

Lys Leu 
190 
Leu Thr 
205 

Pro Glu 

Gly He 

Asp He 

Cys Pro 
270 
Arg Gly 
285 

Ser Glu 

Lys Lys 

Pro Asp 

Glu Ser 
350 
Glu Asp 
365 

Arg Leu 



Thr Arg 
160 
Leu Ser 
175 

Leu Leu 

Ala Leu 

His Lys 

Gin Glu 
240 
Leu Lys 
255 

Ser Leu 

Glu Lys 

Glu Asp 

Ala His 
320 
Asn Val 
335 

Leu He 
He Phe 
Gin Met 



Tyr Leu 



Phe Pro 
400 



(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

(D) OTHER INFORMATION: Xaa at position 3 is Ala, 

Lys, Phe, Cha or Asp. 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 



Tyr Val Xaa Asp 
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05 



(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 4 amino acids 

5 (B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

10 (D) OTHER INFORMATION: Peptide has an acetyl group at the 

amino terminus and an aldehyde group at the carboxyl terminus. 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16; 

Tyr Val Ala Asp 
1 

^15 (2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 4 amino acids 
J (B) TYPE: amino acid 

jj (C) STRANDEDNESS: single 

| 20 (D) TOPOLOGY: linear 

J (ii) MOLECULE TYPE: peptide 

3 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 

J Tyr Val Ala Asp 

U i 

& 25 (2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
30 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE; peptide 
(ix) FEATURE: 

(D) OTHER INFORMATION: Xaa at position 2 is D-Ala. Peptide 
has an acetyl group at the amino terminus and an aldehyde group at 

35 the carboxyl terminus. 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 
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Tyr xaa Ala Asp 
1 

(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 amino acids 

(B) TYPE: amino acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

(D) OTHER INFORMATION: Peptide has an acetyl group at the 
amino terminus and an aldehyde group at the carboxyl terminus. 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 

Tyr Val Lys Asp 
1 

(2) INFORMATION FOR SEQ ID NO: 20; 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 354 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: C- terminal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 

Arg Ser Arg Ser Arg Ser Arg Ala Leu His Ser Ser Asp Arg His Asn 

15 10 15 

Tyr Ser Ser Pro Pro Val Asn Ala Phe Pro Ser Gin Pro Ser Ser Ala 

20 25 30 

Asn Ser Ser Phe Thr Gly Cys Ser Ser Leu Gly Tyr Ser Ser Ser Arg 

35 40 45 

Asn Arg Ser Phe Ser Lys Ala Ser Gly Pro Thr Gin Tyr He Phe His 

50 55 60 

Glu Glu Asp Met Asn Phe Val Asp Ala Pro Thr He Ser Arg Val Phe 
65 70 75 80 

Asp Glu Lys Thr Met Tyr Arg Asn Phe Ser Ser Pro Arg Gly Met Cys 

85 90 95 

Leu He He Asn Asn Glu His Phe Glu Gin Met Pro Thr Arg Asn Gly 

100 105 110 

Thr Lys Ala Asp Lys Asp Asn Leu Thr Asn Leu Phe Arg Cys Met Gly 

115 . 120 125 

Tyr Thr Val He Cys Lys Asp Asn Leu Thr Gly Arg Gly Met Leu Leu 
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140 

U°e U9 -P - - E »U .1. ~ 01V «p ser Ma £ 
£ v al ne « S°. « y «» «« » vai X- - °* ~ 
MP ne Pro E -r His 01. ne Tyr -p - ~ » - - 

180 x D rn t.va He Val Phe Val Gin Ala 

A sn Ala Pro Arg Leu Ala Asn Lys Pro Lys 

195 . pi„ p he Pro Val Leu Asp Ser Val 

Cys Arg xaa Glu Arg Arg Asp Asn Gly Phe ^ 

Asp III val Pro Ala Phe Leu Arg Arg Gly Trp «P - Arg Asp Gly 
HI Le u Phe Asn Phe Z Gly Cys Val Arg Pro Gin Val Xaa Gin Val 

245 c y aa Ala Asp He Leu He Arg Tyr Ala Thr 
Trp Arg Lys Lys Pro Ser Xaa Ala Asp ^ 

260 Ala Ara Glv Ser Trp Phe 

Thr Ala Gin Tyr Val Ser Xaa Arg Asn Ser Ala Arg Gly 

275 m , Yaa lvs Asp Met Asp 

Ile Gin Ala Val Cys Glu Val Phe Ser Thr Hxs Xaa Lys 

295 „ ^ r _ rii \r Phe 

" W pin \7al ASTl LVS LyS Vax Ae*& — j 

Val Val Glu Leu Leu Thr Glu Val Asn Lys ^ ^ 
Z Thr Ser Gin Gly Ser Asn He Leu Lys Gin Me, Pro Xaa Me, Thr 
Xaa Arg Leu Leu Z - * Phe Trp Pro Glu Ala Arg Asn Ser 



340 345 



Ala Val 

(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 131 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: C- terminal 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:21: 

val „ «y v* - - - - ™ ™ ? ly TIP MP M ° » ** 

^ Pro ,eu P„e » PH= ,eu BLy Cys V,! «, « «» -1 xaa «. 

val TrP « S. « ~ T - - - ~ - " 9 ^ 

Thr Th r JL ^ ^ ser X., « g ». se, £a «. «V ~ *P 



-117- 



4 



Phe lie Gin Ala Val Cys Glu Val Phe Ser Thr His Xaa Lys Asp Met 
65 70 75 80 

Asp Val Val Glu Leu Leu Thr Glu Val Asn Lys Lys Val Xaa Cys Gly 
85 90 95 

5 Phe Gin Thr Ser Gin Gly Ser Asn He Leu Lys Gin Met Pro Xaa Met 
100 105 110 

Thr Xaa Arg Leu Leu Lys Lys Phe Tyr Phe Trp Pro Glu Ala Arg Asn 

115 120 125 

Ser Ala Val 
10 130 

(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 223 amino acids 

(B) TYPE: amino acid 

15 (C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
^ {v} FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 

zt 2 0 Arg Ser Arg Ser Arg Ser Arg Ala Leu His Ser Ser Asp Arg His Asn 

■j! 1 5 10 15 

iU Tyr Ser Ser Pro Pro Val Asn Ala Phe Pro Ser Gin Pro Ser Ser Ala 

£ 20 25 30 

W Asn Ser Ser Phe Thr Gly Cys Ser Ser Leu Gly Tyr Ser Ser Ser Arg 

^ 25 35 40 45 

P Asn Arg Ser Phe Ser Lys Ala Ser Gly Pro Thr Gin Tyr He Phe His 

yt 50 55 60 

fy Glu Glu Asp Met Asn Phe Val Asp Ala Pro Thr He Ser Arg Val Phe 

Fy 65 70 75 80 

S 30 Asp Glu Lys Thr Met Tyr Arg Asn Phe Ser Ser Pro Arg Gly Met Cys 
H> 85 90 95 

Leu He He Asn Asn Glu His Phe Glu Gin Met Pro Thr Arg Asn Gly 

100 105 110 

Thr Lys Ala Asp Lys Asp Asn Leu Thr Asn Leu Phe Arg Cys Met Gly 
35 115 120 125 

Tyr Thr Val He Cys Lys Asp Asn Leu Thr Gly Arg Gly Met Leu Leu 

130 135 140 

Thr He Arg Asp Phe Ala Lys His Glu Ser His Gly Asp Ser Ala He 
145 150 155 160 

40 Leu Val He Leu Ser His Gly Glu Glu Asn Val He He Gly Val Asp 

165 170 175 

Asp He Pro He Ser Thr His Glu He Tyr Asp Leu Leu Asn Ala Ala 

180 185 190 

Asn Ala Pro Arg Leu Ala Asn Lys Pro Lys He Val Phe Val Gin Ala 
45 195 . 200 205 

Cys Arg Xaa Glu Arg Arg Asp Asn Gly Phe Pro Val Leu Asp Ser 
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210 215 220 

(2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 294 amino acids 

5 (B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

10 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 

Phe Pro Ala Pro Gin Ala Val Gin Asp Asn Pro Ala Met Pro Thr Ser 

15 10 15 

Ser Gly Ser Glu Gly Asn Val Lys Leu Cys Ser Leu Glu Glu Ala Gin 
20 25 30 

15 Arg lie Trp Lys Gin Lys Ser Ala Glu lie Tyr Pro lie Met Asp Lys 
35 40 45 

. Ser Ser Arg Thr Arg Leu Ala Leu lie lie Cys Atm Glu Glu Phe Asp 
50 55 60 

2! Ser lie Pro Arg Arg Thr Gly Ala Glu Val Asp lie Thr Gly Met Thr 

~ 20 65 70 75 80 

2^ Met Leu Leu Gin Asn Leu Gly Tyr Ser Val Asp Val Lys Lys Asn Leu 

*jf 85 90 95 

^ Thr Ala Ser Asp Met Thr Thr Glu Leu Glu Ala Phe Ala His Arg Pro 

W 100 105 110 

^ 25 Glu His Lys Thr Ser Asp Ser Thr Phe Leu Val Phe Met Ser His Gly 
y 115 120 125 

JJ; He Arg Glu Gly He Cys Gly Lys Lys His Ser Glu Gin Val Pro Asp 

130 135 140 

He Leu Gin Leu Asn Ala He Phe Asn Met Leu Asn Thr Lys Asn Cys 
30 145 150 155 160 

Pro Ser Leu Lys Asp Lys Pro Lys Val lie lie lie Gin Ala Xaa Arg 

165 170 175 

Xaa Asp Ser Pro Gly Val Val Trp Phe Lys Asp Ser Val Gly Val Ser 
180 185 190 

35 Gly Asn Leu Ser Leu Pro Thr Thr Glu Glu Phe Glu Asp Asp Ala lie 
195 200 205 

Lys Lys Ala His lie Xaa Lys Asp Phe lie Ala Phe Cys Ser Ser Thr 

210 215 220 

Pro Asp Asn Val Ser Xaa Arg His Pro Thr Met Gly Ser Val Phe He 
40 225 230 235 240 

Gly Arg Leu lie Glu His Met Gin Glu Tyr Xaa Cys Ser Cys Asp Val 

245 250 255 

Glu Glu He Phe Arg Lys Val Arg Phe Ser Phe Glu Gin Pro Asp Gly 
260 265 270 

45 Arg Ala Gin Met Pro Thr Thr Xaa Arg Val Xaa Leu Thr Arg Cys Phe 
275 280 285 



i \ : 
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Tyr Leu Phe Pro Gly His 
290 

(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 
5 (A) LENGTH: 107 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
10 (v) FRAGMENT TYPE: internal 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:24: 







Ser Val 


Gly 


Val 


Ser 


Gly 


Asn 


Leu 


Ser 


Leu 


Pro 


Thr 


Thr 


Glu 


Glu 


Phe 






1 




5 










10 










15 








Glu Asp 


Asp 


Ala 


He 


Lys 


Lys 


Ala 


His 


He 


Xaa 


Lys 


Asp 


Phe 


He 


Ala 




15 






20 










25 










30 










Phe Cys 


Ser 


Ser 


Thr 


Pro 


Asp 


Asn 


Val 


Ser 


Xaa 


Arg 


His 


Pro 


Thr 


Met 








35 










40 










45 












Gly Ser 


Val 


Phe 


He 


Gly 


Arg 


Leu 


He 


Glu 


His 


Met 


Gin 


Glu 


Tyr 


Xaa 


03 




50 










55 










60 












20 


Cys Ser 


Cys 


Asp 


Val 


Glu 


Glu 


He 


Phe 


Arg 


Lys 


Val 


Arg 


Phe 


Ser 


Phe 


tt 




65 








70 










75 










80 


3 %? 




Glu Gin 


Pro 


Asp 


Gly 


Arg 


Ala 


Gin 


Met 


Pro 


Thr 


Thr 


Xaa 


Arg 


Val 


Xaa 












85 










90 










95 








Leu Thr 


Arg 


Cys 


Phe 


Tyr 


Leu 


Phe 


Pro 


Gly His 














25 






100 










105 

















n 



(2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 187 amino acids 

(B) TYPE: amino acid 

30 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 

35 Phe Pro Ala Pro Gin Ala Val Gin Asp Asn Pro Ala Met Pro Thr Ser 
15 10 15 

Ser Gly Ser Glu Gly Asn Val Lys Leu Cys Ser Leu Glu Glu Ala Gin 

20 25 30 

Arg He Trp Lys Gin Lys Ser Ala Glu He Tyr Pro He Met Asp Lys 
40 35 40 45 

Ser Ser Arg Thr Arg Leu Ala Leu He He Cys Asn Glu Glu Phe Asp 
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50 55 60 

Ser He Pro Arg Arg Thr Gly Ala Glu Val Asp He Thr Gly Met Thr 
65 70 75 80 

Met Leu Leu Gin Asn Leu Gly Tyr Ser Val Asp Val Lys Lys Asn Leu 
5 85 90 95 

Thr Ala Ser Asp Met Thr Thr Glu Leu Glu Ala Phe Ala His Arg Pro 

100 105 110 

Glu His Lys Thr Ser Asp Ser Thr Phe Leu Val Phe Met Ser His Gly 
115 120 125 

10 He Arg Glu Gly He Cys Gly Lys Lys His Ser Glu Gin Val Pro Asp 
130 135 140 

He Leu Gin Leu Asn Ala He Phe Asn Met Leu Asn Thr Lys Asn Cys 
145 150 155 160 

Pro Ser Leu Lys Asp Lys Pro Lys Val He He He Gin Ala Xaa Arg 
15 165 170 175 

Xaa Asp Ser Pro Gly Val Val Trp Phe Lys Asp 
180 185 



(2) INFORMATION FOR SEQ ID NO: 26: 

0 (i) SEQUENCE CHARACTERISTICS: 

J3 2 0 (A) LENGTH: 172 amino acids 

Oy (B) TYPE: amino acid 

C8 (C) STRANDEDNESS : single 

SO (D) TOPOLOGY: linear 

ru 

(ii) MOLECULE TYPE: protein 
yj 25 (v) FRAGMENT TYPE: internal 

f% (xi) SEQUENCE DESCRIPTION: SEQ ID NO:26: 



30 



35 



40 



45 



Met Leu 


Thr 


Val 


Gin 


Val 


Tyr 


Arg 


Thr 


Ser 


Gin 


Lys 


Cys 


Ser 


Ser Ser 


1 






5 










10 










15 


Lys His 


Val 


Val 


Glu 


Val 


Leu 


Leu 


Asp 


Pro 


Leu 


Gly 


Thr 


Ser 


Phe Cys 






20 










25 










30 




Ser Leu 


Leu 


Pro 


Pro 


Pro 


Leu 


Leu 


Leu 


Tyr 


Glu 


Thr 


Asp 


Arg 


Gly Val 




35 










40 










45 






Asp Gin 


Gin 


Asp 


Gly 


Lys 


Asn 


His 


Thr 


Gin 


Ser 


Pro 


Gly Cys 


Glu Glu 


50 










55 










60 








Ser Asp 


Ala 


Gly 


Lys 


Glu 


Glu 


Leu 


Met 


Lys 


Met 


Arg 


Leu 


Pro 


Thr Arg 


65 








70 










75 








80 


Ser Asp 


Met 


He 


Cys 


Gly 


Tyr 


Ala 


Cys 


Leu 


Lys 


Gly 


Asn 


Ala 


Ala Met 








85 










90 










95 


Arg Asn 


Thr 


Lys 


Arg 


Gly 


Ser 


Trp 


Tyr 


He 


Glu 


Ala 


Leu 


Thr 


Gin Val 






100 










105 










110 




Phe Ser 


Glu 


Arg 


Ala 


Xaa 


Cys 


Asp 


Met 


His 


Val 


Ala 


Asp 


Met 


Leu Val 




115 










120 










125 






Lys Val 


Asn 


Ala 


Leu 


He 


Lys 


Glu 


Arg 


Glu 


Gly 


Tyr 


Ala 


Pro 


Gly Thr 


130 










135 










140 








Glu Phe 


His 


Arg 


Cys 


Lys 


Glu 


Met 


Ser 


Glu 


Tyr 


Cys 


Ser 


Thr 


Leu Cys 
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145 150 155 160 

Gin Gin Leu Tyr Leu Phe Pro Gly Tyr Pro Pro Thr 
165 170 

(2) INFORMATION FOR SEQ ID NO: 27: 

5 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 172 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

10 (ii) MOLECULE TYPE: protein 

(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 

Met Leu Thr Val Gin Val Tyr Arg Thr Ser Gin Lys Cys Ser Ser Ser 
15 10 15 

15 Lys His Val Val Glu Val Leu Leu Asp Pro Leu Gly Thr Ser Phe Cys 
_ 20 25 30 

0 Ser Leu Leu Pro Pro Pro Leu Leu Leu Tyr Glu Thr Asp Arg Gly Val 
MP 35 40 45 

G? Asp Gin Gin Asp Gly Lys Asn His Thr Gin Ser Pro Gly Cys Glu Glu 

ffl 20 50 55 60 

03 Ser Asp Ala Gly Lys Glu Glu Leu Met Lys Met Arg Leu Pro Thr Arg 

fU 65 70 75 80 

=C Ser Asp Met lie Cys Gly Tyr Ala Cys Leu Lys Gly Asn Ala Ala Met 

ty 85 90 95 

s 25 Arg Asn Thr Lys Arg Gly Ser Trp Tyr lie Glu Ala Leu Thr Gin Val 
p 100 105 110 

01 Phe Ser Glu Arg Val Xaa Cys Asp Met His Val Ala Asp Met Leu Val 
fSJ 115 120 125 

flj Lys Val Asn Ala Leu lie Lys Glu Arg Glu Gly Tyr Ala Pro Gly Thr 

p 30 130 135 140 

y. Glu Phe His Arg Cys Lys Glu Met Ser Glu Tyr Cys Ser Thr Leu Cys 

145 150 155 160 

Gin Gin Leu Tyr Leu Phe Pro Gly Tyr Pro Pro Thr 
165 170 
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